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Abstract 

Several real-world networks exhibit a complex structure and are formed due to strategic interactions among rational and 
intelligent individuals. In this paper, we analyze a network formation game in a strategic setting where payoffs of individuals 
depend only on their immediate neighbourhood. We call these payoffs as localized payoffs. In this network formation game, the 
payoff of each individual captures (1) the gain from immediate neighbors, (2) the bridging benefits, and (3) the cost to form links. 
This implies that the payoff of each individual can be computed using only its single-hop neighbourhood information. Based on 
this simple and appealing model of network formation, our study explores the structure of networks that form, satisfying one or 
both of the properties, namely, pairwise stability and efficiency. We analytically prove the pairwise stability of several interesting 
network structures, notably, the complete bi-partite network, complete equi-k-partite network, complete network and cycle network, 
under various configurations of the model. We validate and further extend these results through extensive simulations. We then 
characterize topologies of efficient networks by drawing upon classical results from extremal graph theory and discover that the 
Turan graph (or the complete equi-bi-partite network) is the unique efficient network under many configurations of parameters. 
We next examine the tradeoffs between topologies of pairwise stable networks and efficient networks using the notion of price 
of stability, which is the ratio of the sum of payoffs of the players in an optimal pairwise stable network to that of an efficient 
network. Interestingly, we find that price of stability is equal to 1 for almost all configurations of parameters in the proposed 
model; and for the rest of the configurations of the parameters, we obtain a lower bound of 0.5 on the price of stability. This 
leads to another key insight of this paper: under mild conditions, efficient networks will form when strategic individuals choose 
to add or delete links based on only localized payoffs. 
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r/3 | I. Introduction 

Several real world networks such as the Internet, social networks, organizational networks, biological networks, food webs, 
co-authorship networks, citation networks, and many more exhibit complex network structures. Complex networks, generally 
modeled as graphs in most of the mathematical literature, have been extensively studied in recent years and they are pervasive 
in today's science and technology (HI 0; S H)- Studying the properties of the complex network structures helps to understand 
the underlying phenomena and developing new insights into the system such as small-world phenomena, scale-free topology, 
O ■ and structural holes (H S H 5 1)- 

Complex networks have also been studied extensively in the social sciences dH § OH H) (and the references therein). 
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These studies reveal that complex social networks play an important role in spreading information (1.2; 12; 14; 15; 17). 
Individuals that participate in the process of information dissemination in such networks receive various kinds of social and 
C^l . economic incentives and at the same time they also incur costs in forming and maintaining the contacts (i.e. links) with other 
\ I 'individuals in terms of time, money, and effort. For this reason, individuals do act strategically while selecting their neighbors. 
J> , Thus, in several contexts, the behavior of the system is driven by the strategic actions of a large number of individuals, each 
K^j motivated by self-interest and optimizing an individual objective function. Thus, it is important to study the dynamics of 
. strategic interaction among the individuals in complex social networks in order to understand how such networks form and 
& this is the primary motivation for this paper. 

Many recent studies on network formation have used game theoretic approaches (H3 US IzH 1^5 IHL based 



on the observation that individuals are strategic and are interested in maximizing their payoffs from the social interactions. 
These models capture the strategic interactions among individuals and the analysis of these models satisfactorily deduces the 
topologies of equilibrium networks. In this domain, networks that are enforced by a central authority are known as efficient 
networks. Understanding the compatibility between equilibrium networks and efficient networks has been the primary focus 
of research in network formation Q27 ; ljj| 2^; 

13 13 W, [U E2). 



The crux of most of the models for network formation in the literature (27J [33fc |34j; |35|; |36t |30t 1311) is the underlying strategic 
form game where the players, strategies, and utilities (also termed as payoffs) are defined as follows: (i) the individual agents 
in the complex network are the players, (ii) the strategy of each agent is a subset of other agents with which it wishes to form 
links, and (iii) the utility of each agent depends on the structure of the network. 

Another key aspect of most of the existing work in the literature is that the process of network formation is modeled in 
a decentralized fashion where the individuals in the network take autonomous decisions regarding whether to form or delete 

*Any correspondence can be addressed to rohithdv@gmail.com 
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links with other agents. However, most of these models require the agents to know the complete global structure (that is, 
information about all nodes as well as all the links between the nodes) of the network to compute their respective payoffs. In 
many practical scenarios, this will be a very demanding requirement making the utility computation a cumbersome and often 
intractable task. Moreover, empirical evidence has clearly shown that a significant fraction of the perceived social and 



economic benefits for the individuals is derived from their 1-hop or 2-hop neighborhood. Motivated by this, a few models of 
network formation have been investigated that use local information (such as information about 1-hop or 2-hop neighborhood). 
For instance, Kleinberg and co-authors ([38) propose a network formation model where the utility function of each node is 
based on 2-hop neighborhood information. However, in several real-world examples, we observe that complete knowledge 
about 2-hop information may be infeasible and nodes may need to get a reasonably accurate estimate of their payoffs by 
using just their immediate neighborhood (or 1-hop) information. In fact, we can observe such constraints in several real-world 
examples like distributed sensor networks and real-life social networks. In distributed sensor networks, coalitions of sensors can 
work together to track targets of interest and each sensor knows only its immediate neighborhood. In real-life social networks, 
it may not be possible for an individual to know all the friends of his/her immediate friends. Note that individuals can know 
partial information about their 2-hop neighborhood (i.e. friends of friends); however, this partial information is inadequate to 
accurately compute the payoffs of the individuals. Hence, in such settings, it becomes important to study the network formation 
process using only single hop neighborhood information and this is the primary motivation behind our work in this paper. 

In this paper, we explore a novel model of network formation process from an economic perspective in which individuals 
derive payoffs (consisting of benefits from immediate neighbors as well as structural holes and the costs to form links) using 
purely local neighbourhood information and we refer to this setting as network formation with localized payoffs. The primary 
contribution of our work is to come up with a game theoretic model in the above setting and study the topologies of the 
equilibrium networks and efficient networks that emerge in such a network formation process. We next examine the tradeoffs 
between topologies of equilibrium networks and efficient networks using the notion of price of stability $3$) . Informally, price 
of stability is the ratio of the sum of payoffs of the players in an optimal (in terms of sum of payoffs of the players) pairwise 
stable network to that of an efficient network. Interestingly, we find that price of stability is 1 for almost all configurations of 
the parameters in the proposed model; and for the rest of the configurations of the parameters in the proposed model, we obtain 
a lower bound of 0.5 on price of stability. This indicates that, when some mild conditions are satisfied, efficient networks will 
form when strategic individuals choose to add or delete links based on localized payoffs. 

We note that our model assumes that a link forms with the consent of both the individuals (refer to Section [II]), as social 
contacts usually emer ge i n this manner. This assumption is widely considered in several models of network formation in 
the literature (129c l33b 128c l39c 140c klh . In such situations, an appropriate choice for the notion of equilibrium is pairwise 
stability (133b . Informally, we call a network pairwise stable if no agent can improve its utility by deleting any link and no two 
unconnected individuals can form a link to improve their respective payoffs. We call a network efficient if the sum of payoffs 
of the individuals is maximal. In this framework, our objective is to investigate the tradeoff between topologies of pairwise 
stable and efficient networks. In the rest of the paper, we use the terms graph and network interchangeably. We thus use the 
terms nodes and individuals interchangeably throughout the paper. As a game-theoretic approach is used, we sometimes use 
the terms players and individuals interchangeably throughout the paper. 

A. Relevant Work 

The field of network formation has been extensively studied in diverse fields such as sociology, physics, computer science, 
economics, mathematics and biology dl^ 1^3 iSt S3 S3 1^5 iSll [3^ S3 S3 |H3 

25 : 26h . In this section, we have included a discussion of the models that are most relevant to our work. 



The modeling of strategic formation in a general network setting was first studied in the seminal work of Jackson and 
Wolinsky (13 3l) . They basically consider a value function and an allocation rule model where the value function defines a value 
to each network and the allocation rule distributes this value to the nodes in the network. They investigate whether efficient 
networks will form when self-interested individuals can choose to form links and/or break links. The authors define two stylized 
models. For these models, the authors observe that for high and low costs the efficient networks are pairwise stable, but not 
always for medium level costs. They also examine the tension between efficiency and stability and derive various conditions 
and allocation rules for which efficiency and pairwise stability are compatible. An important feature their model does not 
capture is that of the intermediary benefits that nodes gain by being intermediaries lying on the paths between non-neighbor 
nodes. In particular, they do not capture the benefits due to structural holes. 

Hummon (28) carries out several interesting investigations to unravel more specific topologies using a specific model proposed 



by Jackson and Wolinsky (33). Two different agent-based simulation approaches, the multi-thread model and the discrete event 
simulation model, are used in the analysis done by Hummon (1281) to explore the dynamics of network evolution based on a 
model proposed in Jackson and Wolinsky (I33h . Hummon identifies certain pairwise stable structures that are more specific than 
those anticipated by the formal analysis of Jackson and Wolinsky (33). Doreian (29) explores the same issue in a systematic 
manner and establishes the conditions under which different pairwise structures are generated. Some gaps in the analysis of 
Doreian J29h are addressed by Xie and Cui (4(J 41). 
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Jackson (1391) reviews several models of network formation in the literature with an emphasis on the tradeoffs between 
efficiency with stability. This work also studies the relationship between pairwise stable and efficient networks in a variety of 
contexts and under three different definitions of efficiency. A later paper by Jackson (1471) presents a family of allocation rules 
(for example, networkolus) that incorporate information about alternative network structures when allocating the network value 
to the individual nodes. The author provides a general method of defining allocation rules in network formation games. 

Goyal and Vega-Redondo (1431) propose a non-cooperative game model in which a node i can benefit from serving as an 
intermediary between a pair of nodes x and y. In their model, a node i could lie on an arbitrarily long path between x and y. 
The authors assume, however, that the benefits from farther nodes are not subject to decay. They also assume that the benefit 
of communication between any pair of nodes is always 1 unit. This 1 unit is distributed to the two communicating nodes and 
only to certain so called essential nodes d43l) on the paths between the two communicating nodes. In this setting, the authors 
show that a star graph is the only non-empty robust equilibrium graph. The authors also study the implications of capacity 
constraints in the ability of individual nodes to form links to other nodes and show that a cycle network emerges. 

Ramasuri and Narahari (51) propose a generic model of network formation that essentially builds on the model of Jackson- 
Wolinsky This model simultaneously captures four key determinants of network formation: (i) benefits from immediate 
neighbors through links, (ii) costs of maintaining the links, (iii) benefits from non-neighboring nodes and decay of these benefits 
with distance, and (iv) intermediary benefits that arise from multi-step paths. The authors (15 11) analyze the proposed model to 
determine the topologies of stable and efficient networks. 

The aforementioned models of network formation have the limitation that each individual (or node) needs to know global 
information about the structure of the network in order to compute its utility. A few recent models ( 42 ; 52 ; 38|) in the literature 
make an attempt to overcome the above limitation. 

> Buskens and van de Rijt (1421) propose a model that requires each individual agent to know just its immediate neighbors (or 
1-hop neighborhood) to optimize its own utility. However, the model captures only the cost to nodes and ignores various 
benefits that nodes can derive from the network such as direct benefits from the neighbors and the bridging benefits. 

> Arcaute, Johari, and Mannor (15 2l) study the myopic dynamics in network formation games. A key aspect of the dynamics 
studied in this model is the local information and the authors show that these dynamics converge to efficient or near efficient 
outcomes. However, the model does not characterize the topologies of equilibrium and efficient networks. Moreover, the 
model works with Pareto efficiency whereas we work with a more natural notion of efficiency, namely maximizing the 
sum of payoffs of all the nodes. 

• Kleinberg and co-authors (i38h characterize the structure of stable networks with Nash equilibrium as the notion of 
stability. The authors propose a polynomial time algorithm for a node to determine its best response in a given graph as 
nodes can choose to link to any subset of other nodes. They also show that stable networks have a rich combinatorial 
structure. However, the model needs each individual agent to know its 2-hop neighborhood (the set of all individuals 
that are reachable within two hops) to compute and optimize its own utility. The model works with Nash equilibrium 
while our proposed model works with the more natural notion of pairwise stability as the notion of equilibrium. Also, 
our model considers only single hop neighbourhood which is more appropriate for certain kinds of social networks as 
already explained. Moreover, the model (I38l) does not study the tradeoff between the topologies of stable networks and 
the topologies of efficient networks. 



B. Our Contributions 

To the best of our knowledge, our current study is the first one to comprehensively explore the tradeoff between pairwise 
stability and efficiency using the notion of price of stability in the context of strategic network formation with localized payoffs, 
while taking into account several key factors such as link costs, link benefits, and bridging benefits. The following are the 
specific contributions of our paper. 

• Section^ An Elegant Model for Network Formation with Localized Payoffs: We propose a strategic form game to model 
the process of network formation with localized payoffs and we term the game as network formation (game) with localized 
payoffs (NFLP). The utility of each player in the proposed game takes into account not only the benefits (i5) that arise 
from routing information to and from its neighbors but also the cost (c) to maintain a link to each of its neighbors. 

• Section IT//} Analytical Characterization of Topologies of Pairwise Stable Networks: We first analytically characterize the 
topologies of the pairwise stable networks using the NFLP model. Some of the networks that we consider for analysis 
include the cycle, star, complete and null networks. In addition, we also derive pairwise stability conditions for certain 
classes of k-partite networks namely bipartite complete networks, complete equi-tri-partite networks and complete equi-k- 
partite networks. We note that our findings extend the possible topologies for pairwise stable networks compared to that 
of other models in the literature. 

• Section \LV\ Simulation of Network Formation Process and Additional Insights: Next, we simulate strategic dynamics 
in NFLP to understand how pairwise stable networks evolve over time. Our simulation results validate our analytical 
deductions and also reveal additional interesting insights on the topologies of pairwise stable networks. In addition, we study 
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the emergent pairwise stable topologies during the network formation process and study the evolution of pairwise stable 
network and its properties like the clustering co-efficient, convergence time, etc. over different configuration parameters. 

• Section Analytical Characterization of Topologies of Efficient Networks: Next, we analytically characterize topologies 
of efficient networks by drawing upon classical results from extremal graph theory. Our work leads to sharp deductions 
about the efficient networks in NFLP. A striking discovery of our study here is that the equi-bi-partite graph (popularly 
known as the Turan graph) emerges as the unique efficient network under many regions of values of S and c. 

• Section\VI$ Price of Stability Investigations: The quality of optimal (in terms of the sum of payoffs of the individuals in 
the network) pairwise stable networks is best understood through the notion of price of stability (PoS). PoS allows us to 
explore the middle ground between centrally enforced solution and completely unregulated anarchy d35l) . In most real- 
world applications, the nodes are not completely unrestricted in their strategic behavior but rather agree upon a prescribed 
equilibrium solution. In such scenarios, the prescription can be chosen to be the best equilibrium thus making the price 
of stability an important issue to study. We study the PoS in NFLP to reveal tradeoffs between pairwise stable networks 
and efficient networks. Intriguingly, we find that PoS is 1 for almost all configurations of 5 and c. For the remaining 
configurations of S and c, we obtain a lower bound of h on PoS. This implies, under mild conditions on 5 and c, that 
the proposed NFLP model produces pairwise stable networks that are efficient. 



II. A Model for Network Formation with Localized Payoffs 



We model network formation using a strategic form game (1181) . We consider a network setup with n players denoted by 



N = {1,2,..., n). A strategy Sj of a player i is any subset of players with which the player would like to establish links. We 
assume that the formation of a link requires the consent of both the players. Assume that Si is the set of strategies of player 
i. Let s — (si, S2, ■ ■ ■ , s n ) be a profile of strategies of the players. Also let S be the set of all such strategy profiles. Each 
strategy profile s leads to an undirected graph and we represent it by G(s). If there is no confusion, we just use G. If players 
x and y form a link (x, y) in a graph g, then we represent the new graph by g + (x, y). We assume that players in the network 
communicate using shortest paths - this is a standard assumption used in the literature for ease of modeling. In the rest the 
paper, we use the terms players, nodes, and agents interchangeably. 

Degree of Node: The degree dj, of node i represents the number of neighbors of node i. 

Costs: If nodes i and j are connected by a link, then we assume that the link incurs a cost c G (0, 1) to each node. That is, 
if the degree of node i is di, then node i incurs a cost of cdi. 

Benefits from Immediate Neighbors: Assume that i5 G (0, 1). If node i is connected to a node j by a direct link, then we 
assume that node i gains a benefit of 5. That is, if the degree of node i is di, then node i gains a benefit of Sdi from its 
immediate neighbors. 

Bridging Benefits: Consider a node i. Assume that nodes j and k are two neighbors of node i such that j and k are not 
connected by a direct link. Suppose that nodes j and k communicate using the length 2 path through node i, then (i) we assume 
that a benefit of 8 2 arises due to this communication, and (ii) we also assume that the benefit 5 2 entirely goes to node i. We 
refer to 5 2 as the bridging benefit to node i. The main motivation for this kind of bridging benefits is by sociological studies 
suggesting that in practice most of the bridging benefits arise from bridging the communication between pairs of non-neighbor 
nodes in the network J53lh 

In this framework, we define the utility of node i such that it depends on the benefits from immediate neighbors, the costs 
to maintain links to these immediate neighbors, and the bridging benefits. More formally, for any i G N, the utility itj of node 
i in an undirected graph G is defined as follows: 

Ui (G) =d i {5-c) + d i (l- -p^j 62 (D 

where o~i is the number of links among the neighbors of node i in G. There are two terms in this utility function. The first term 
specifies the net benefit to node i from its immediate neighbors. The second term specifies the sum of bridging benefits to node 
i. Here 1 — -pfc is the fraction of pairs of neighbors of node i that are non-neighbors and di normalizes the level of bridging 

benefits that node i gains in the network. For example, the fraction of pairs of neighbors of node 1 that are non-neighbors in 
both gl and g3 in Figure Q] is 1.0. However the degree of node 1 in gl is d\ — 5 and the degree of node 1 in g3 is d\ = 2. 
The normalization term di ensures that the bridging benefit for node 1 is higher in gl than in g3. Note that the bridging benefit 
of our proposed model can also be altered by introducing an arbitrary increasing, real-valued function of di (call it f{di)). In 
this case, the utility model (Equation [TJ becomes as follows: 

Ui{G) = d t {8 ~c) + f(di) (l - S 2 . 

For ease of analysis, we work with f(di) = di throughout this paper. 

Note: Assume that node i bridges the communication between j and k; and a benefit of S 2 is generated. In the literature, 
there are three well known ways of distributing the benefit S 2 to nodes i, j, and k: (i) only node i gets entire S 2 , (ii) node i 
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Figure 1: An illustrative example 

gets 0, and (iii) nodes i, j, and k get equal share of 8 2 . In this paper, we work with scenario (i). A similar approach is utilized 
in l38t) as well. We note that the analysis that we perform using scenario (i) can be easily extended to other two scenarios. 

A. The Network Formation Game 

The above framework defines a strategic form game T = LN, (S l j)i 6 jv , (wj)igjvl mat models network formation with 
localized payoffs. We refer to this as network formation game with localized payoffs (NFLP). The following example illustrates 
NFLP. 

Example 1: Assume that = {1, 2, 3, 4, 5, 6} is the set of 6 players. If s% — {2, 3, 4, 5, 6}, s 2 — {1}, S3 = {1}, S4 = {1}, 
S5 = {1}, sq = {1}, then the resultant graph gl is the star graph as shown in Figure [T](i). Note that an edge forms with the 
consent of both the nodes. 

Following the NFLP model, the payoffs of the players in the star graph are as follows: m(gl) = 5(6 — c) + 55 2 and 
u 2 (gl) = u 3 (gl) = 04(51) = 145(51) = "6(5!) = ($- c). 

If si = {2, 3, 4, 5, 6}, s 2 = {1, 3, 6}, s 3 = {1, 2, 4}, s 4 = {1, 3, 5}, s 5 = {1, 4, 6}, s 6 = {1, 2, 5}, then the resultant graph 
g2 is the wheel graph as shown in Figure [T](ii). Following the NFLP model, the payoffs of the players in the wheel graph are 
as follows: 1*1(52) = 5(6 - c) + ^- and u 2 (g2) = u 3 (g2) = u 4 (g2) = u 5 (g2) = u 6 (g2) = 3(6 - c) + 6 2 . 

On similar lines, if s% = {2, 6}, s 2 = {1, 3}, S3 = {2, 4}, S4 = {3, 5}, S5 = {4, 6}, sq — {1, 5}, then the resultant graph g3 
is the cycle graph as shown in Figure Q] (iii). Following the NFLP model, the payoffs of the players in the cycle graph are as 
follows: ui(gZ) = u 2 (g3) = u 3 (g3) = u 4 (g3) = u 5 (g3) = u 6 (g3) = 2(6 - c) + 26 2 . 



III. Analytical Deductions on Topologies of Pairwise Stable Networks 

In this section, we first recall the notion of pairwise stability. Then, we characterize the topologies of pairwise stable networks. 
To begin with, we note that the notion of pairwise stability is defined by Jackson and Wolinsky (1331) . Formally, we call an 
undirected graph G = (V,E) pairwise stable (1331) if (i) V(i,j) 6 E,m(G) > Ui(G — (i,j)) and Uj(G) > uj(G ~ (i,j)), (ii) 
V(i,j) i E, if m(G) < Ui (G+ (i,j)) then u 3 (G) > u,(G + (i,j)). 

We now focus on characterizing the topologies of the pairwise stable networks that may emerge following the framework in 
NFLP. Characteri zing pairwise stable networks under various network formation models has been addressed in the literature 



d!9|). (|20|). d42h . d43b, (|38b, (|36|), OO), OJJ), OJ, (|29|), (|49D, ([50). In our approach, we consider the topologies of certain 
standard networks (such as complete network, cycle network, star network, multi-partite networks) and then study whether 
such topologies are pairwise stable following the framework of NFLP. We now present few results to establish certain standard 
networks are pairwise stable in the framework of NFLP. 

Proposition 1: If (6 — c) < 6 2 and (c — 6) < 6 2 , then the complete bipartite network is pairwise stable. 
Proof: 

Consider a complete bipartite network, G, with a\ and a 2 nodes respectively in the two partitions. The utility of node i in 
a partition with a\ nodes is Ui(G) = a 2 (6 — c) + a 2 6 2 . This proposition can be proved in two steps. 

Step 1: Let us now add the edge (i,j) to G and call the resultant graph G. It can be readily checked that Ui (G) = (a 2 + 1) (5 — 
c) + (a 2 — l)5 2 . Since we are given that 5 2 > (5— c), we get that Ui(G) = a 2 (5 — c)+a 2 6 2 > (a 2 + l)(8— c) + (a 2 — 1)6 2 =Ui(G). 
That is, no pair of non-neighbor nodes is better off by forming a link in G. 

Step 2: Assume that node i severs an edge in G and call the resultant graph G. It can be shown that Ui(G) = (a 2 — 1)(S — 
c) + (a 2 — 1)6 2 . Since we are given that 6 2 > (6 — c), it is immediately seen that Ui(G) > Ui(G). Node i is not better off by 
severing a link in G. 

Note that we can apply similar analysis with respect to each node in the other partition. Hence the complete bipartite network 
is pairwise stable. ■ 

Proposition 2: (a) The complete network is pairwise stable if (c — 6) < (b) The cycle network is pairwise stable if 
1 < (c - 6)/6 2 < 2, (c) The null (empty) network is pairwise stable if (6 — c) < 0. 
The result can be proved easily by using arguments similar to that in Proposition Q] 
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Proposition 3: For fc > 3, the complete fc-partite network is pairwise stable if (i) 6 = c, and (ii) <n = a, Vi 6 {1,2, ...,fc} 
where dj is the number of nodes in partition i in fc-partite network and a is any positive integer. 

Proof: We start with a fc-partite graph, G, satisfying condition (ii) given in the statement of this proposition. Consider a 
node i in the p th partition of G where 1 < p < fc. We construct the proof in two steps. 

Step 1 (edge addition): We can see that, in G, the only link that can be added from node i is to a node j in the p th partition. 
Let G be the network obtained after a new link is added to G. For pairwise stability, we need Ui(G) — Ui(G) < 0. This 
implies 



where cr^ is the number of links among the neighbours of node i in G and Ui is the number of links among the neighbours of 
node i in G. Note that di = dj since nodes i and j belong to the same partition in G. Now we get that a i = oi + dj = o~i + di. 
Simplifying, we get 

um-u i {G) = (8-o)-^ + ^[^^ (2) 

Since the term ' — - lies in the interval [0, 1] and the fact that 5 — c (given in the statement of this proposition), we get 

cLidi — 1) 

that expression (fjf is non-positive. This implies that no pair of nodes can form a link to improve their respective payoffs. 

Step 2 (edge deletion): In G, consider that node i deletes a link to a node j in the q th partition where 1 < q < k and p^g. 
Let G be the network obtained after the link (i, j) has been deleted from G. For pairwise stability, we need Ui(G) — Ui(G) < 0. 
This implies 



-(6 -c) + (d i - l)Wl--^ 



where a i denotes the number of links among the neighbours of node i in G. We can see that a i = a. L — dj + a L . Simplifying, 

- (i - e) -^ r^rr°> + jg-),o 



expri 

Claim: expri < L 

Proof of the Claim: We know that di — 2~2j^i a j ■ Now, we derive an expression for crj. 



(4) 



Now, we show that expri < 1. The proof is by contradiction. Suppose expri > L 

> 1 



/ —2<7i + 2dj — 2a,; 2ffi 
V d t - 2 + d, - L 

2(dj - a, - a,-)(di - 1) + (2o- l )(d i - 2) > - 2)(d l - 1) 

- 2a, - 2a t d t - 2dj + 2a,) > (dj - Mi + 2) (5) 

From condition (2) in Proposition [3] we have a,; — l,Vi and = dj = (fc — l)a. Also, using Equation (0]) in Equation (|5]l 
and simplifying, we have 

(fc+ l)a- (fc- l)a 2 > 2 (6) 
=> (fc + l)a > 2 + (fc - l)a 2 > (fc - l)a 2 

'fc + A 



a < 



1 



Let y(k) — (f^j). As we know that the function y(k) is a decreasing function of fc (as derivative of y(k) with respect to fc 
is < 0), we can write 

a < y(2) a < 3 

So, clearly we can conclude that expri > 1 for < a < 3 (i.e., a = 2 and a = 1) and expri < 1 for a > 3. 
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Now we will examine what happens when a = 1 and a = 2. Substituting a = 1 in Equation © and simplifying, we get 
2 > 2 which is absurd. Substituting a = 2 in Equation (O and simplifying, we get k < 2 which violates the hypothesis that 
fe > 3. Hence, by the above arguments, expri < l,Va € {1, 2, ...}, Vfc > 3. This completes the proof of the claim. 

Note that we are given that 5 = c. Thus, from Equation (f3j), 



-S 2 



-2<Ji + 2cL 



2a,- 



2ct, 



< 



tti(G)-Ui(G) <0 



Thus, node i does not have any incentive to add an edge to G or delete an edge from G when the conditions given in the 
statement of the proposition are satisfied. As node i is chosen arbitrarily from G, we have that G is pairwise stable. ■ 

Using a similar approach, we can prove the stability results for other standard networks. We summarize these results in 
Table Q}\ and the graphical illustration of these results is depicted in Figure [2] 
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C.B.P 



5 C.E.K.P: Complete Equi /f-Partite 
6 C.E.T.P: Complete Equi Tri-Partite 

Table I: Characterization of pairwise stable 
network topologies in the proposed utility model 



Pairwise Stability Regions 
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cost (c) 

Figure 2: Graphical Illustration 



IV. Simulation: Validation and Additional Insights on Topologies 

In this section, we investigate various aspects of the network formation game through extensive simulations. The main 
purpose of this exercise is to get a better understanding of the network formation process as theoretical analysis has limited 
scope in enabling the understanding of the cumulative effects of many of the parameters like the initial network density, 
cost-benefit values, scheduling order of the nodes, etc that influence the network formation process. 

In the network formation process, starting from some initial configuration of a network, the resultant topology of pairwise 
stable network may not be any of the standard networks considered in the previous section. In other words, these simulation 
results reveal that there could exist certain other topologies that satisfy pairwise stability apart from these standard networks. 

Starting with some initial network (the null network, for example), the network structure changes with time as various nodes 
in the network add or remove links to their neighbors, so as to maximize their own individual utility from the network. It 
would be interesting to determine if, in the long run, the network reaches a stable state (an equilibrium or a near-equilibrium 
state). If the network does reach a stable state, it would be interesting to know the structure (i.e. shape) of the stable network 
and if this stable network is unique. One way of approaching this is to start with the initial network and model the dynamics 
of the system as a function of time (or an analogous parameter) and analytically study the asymptotic network structure in 
the limit as time tends to infinity. However, the dynamics of the system can become very complex even in a moderately sized 
network, making such an approach infeasible. Further, such results would only be valid for those particular initial networks. 

Another approach is to analyze the stability of some of the standard networks (complete network, cycle network, star network 
etc.) under our utility model (as presented in Table H). It would then mean that if the network reaches any of these standard 
stable networks, it is guaranteed to not deviate from this network. However, one problem with this approach is that starting 
from some initial network, we may not reach any of these standard networks. That is, some non-standard networks could be 
stable and the dynamic network could emerge into one of these non-standard networks. 

'Note that the legends in the figure correspond to the numbering specified in Table U 
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A. Simulation Setup 

We built a custom simulator using the C++ programming language in order to model the network formation process under 
our proposed network model. To implement the standard graph routines, we used the BOOST C++ libraries (54) which has 
efficient implementations of fundamental graph data structures and routines. We start with a random initial network consisting 
of n nodes. The number of edges between these nodes is determined by the parameter density (j). For example, if 7 = 0, 
we start with an empty network; if 7 = 0.35, we start with a network that contains 35% of the possible (™) edges. These 
edges are chosen uniformly at random. As noted in Section [II] a node obtains a benefit of 5 (0 < 5 < 1) and incurs a cost (c 
(0 < c < 1)) for maintaining a direct relationship (represented by an edge) with another node. In addition, each node reaps 
additional indirect benefit because of its potential to bridge its unconnected neighbors (determined by sparsity of relationships 
among his neighbors). 

B. The Simulation Process 

We run the simulations for each combination of possible values of S and c as shown in Table [II] given below. A single 
simulation run refers to a simulation with a particular value of of 5 and c. Further, each simulation run is repeated multiple 
times as per the Num-Repetitions parameter. We now describe the details of a single simulation run below. 

In a particular simulation run, each node is given an opportunity to act, based on a random schedule. Each node, when 
scheduled, considers three actions - namely, add an edge to a node that it is not directly connected to, delete an existing edge 
to a node, or do nothing. Each node chooses the action that maximizes its individual payoff (which is based on the parameters 
5 and c), breaking ties randomly. Node i, when adding an edge to node j, may be allowed to do so only if it is beneficial to 
both or if node j is at least not worse off (mutual add (MA)). Similarly, node i, when deleting an existing edge to node j, 
may be allowed to do so unilaterally (unilateral delete). We study pairwise stable network evolution under these conditions. 

Table [II] lists the various simulation parameters. At some stage in the simulation, the network could evolve into a stable 
state where no node has any incentive to modify the network. One iteration in which no node modifies the network is an idle 
iteration, and the parameter Num-Idle-Terminate indicates the number of idle iterations before we conclude that the network 
has reached a stable state. This is the case of normal termination of a simulation run. However, there may be cases where the 
network does not emerge into a stable state and cycles through previously visited states even after many iterations (the case 
of dynamic-equilibrium as noted in Hummon (1281) 1. The parameter Max-Iterations indicates the number of iterations before 
we forcibly terminate the simulation run. However, we have observed that all the simulation runs achieved convergence much 
before the maximum iterations allowed indicating that the formation of dynamic equilibrium is not possible in our utility 
model. However, we leave the formal proof of this observation as a future work. The parameter Num-Repetitions indicates the 
number of times each simulation run was repeated. The simulations were averaged out over different initial conditions and 
random schedules. 



Parameters 


Values 


N 


3, 4, 5, 10, 20 


Cost (c) 


0.05 to 1, in steps of 0.05 


Benefit (<5) 


0.05 to 1, in steps of 0.05 


Density (7) 


0, 0.35, 0.7 


Experiment 


Mutual-Add, Unilateral-Delete 


Num-Iterations 


1000 


Num-Repetitions 


100 


Num-Idle-Terminate 


30 



Table II: Simulation parameters and Values 




Figure 3: A stylized 5-node network 



C. Metrics Recorded 

At the end of Num-Repetitions number of repetitions, a number of metrics were recorded. The following lists some of the 
important metrics recorded. 

1) The network structure (shape) for each repetition 

2) The frequency with which each of the network structures in Section IIV-DI resulted (across all repetitions) 

3) The mean utility of the final network (across all repetitions) 

4) The mean time to reach the final network (across all repetitions) 

5) The mean number of acts to reach the final network (across all repetitions) 

Before we present the results, we briefly describe the classification criteria used to identify pairwise stable networks. 
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D. Classification of Pairwise Stable Network Structures 

Once the network reaches a stable state, we classify the network structure as one of the network structures shown in Table Hill 
As in Hummon (28), we use the sorted (descending order) degree vector to characterize the structure of the stable network. 
For example, the Null network has a sorted degree vector of (0, 0, ... , 0), the Star network (n-1, 1, 1, . . . , 1) and the Complete 
network (n-1, n-1, . . . , n-1). We refer to a network structure a shared network if it is a regular network (i.e., all nodes have 
same degree) of some uniform degree. For example, a cycle is a 2-regular graph and hence is a shared network. 

Also as in Hummon (1280 . we use total mean squared deviation (MSD) to classify the resultant stable network as Near- 
"standard network" (for example, Near-complete network). Further, if the mean squared deviation is above a certain threshold 
Or) then we know its not close to any of the above topologies, we then color the graph using a greedy coloring algorithm 
d54l) and then classify it either as a general k-partite graph (where k equals the number of colors required to color the graph) 
or any of the other network structures shown in Table Hill In our simulations, we use the maximum deviation ((n — l) 2 ) for 
calculating the t, i.e., r = 0.1 x (n — l) 2 . 

Note that whenever we classify a network as any type of K-Partite network, we implicitly mean that K > 3. The case 
of K = 2 is the same as bipartite network and is handled as a separately as shown in Table [III] Turan network refers to a 
complete bipartite network with the sizes of the two partitions to be as equal as possible. If N is even, then the Turan network 
has equal sized partitions whereas if N is odd, the size of one partition is one less than the other partition. 

For classification of a sorted degree network as a near-shared network, we first need to calculate the order of the regular 
network with which this degree vector needs to be compared. As in Hummon (28), to compute the total mean squared deviation 
for the shared structure, the ideal order is defined by average number of ties in the in-out degree vector, rounded to the nearest 
whole tie. In this example, if the degree vector is (3,2,1,1,1), the average is 1.6, and the ideal type shared structure is (2,2,2,2,2). 
However, note that a cycle network is necessarily a shared network but a shared network need not always be a cycle network. 



NULL 


STAR 


SHARED 


COMPLETE 


NEAR-NULL 


NEAR-STAR 


NEAR-SHARED 


NEAR-COMPLETE 


BI-PARTITITE-COMPLETE 


TURAN 


EQUI-K-PARTITE-COMPLETE 


EQUI-K-PARTITE 


K-PARTITE-COMPLETE 


K-PARTITE 







Table III: Possible Network Structures considered in the simulations 



The following example clarifies this procedure: Consider the 5-node network as shown in Figure [3] Suppose that we would 
like to classify this network as one of the following standard networks : Null, Star, Shared, Complete, Near-Null, Near-Star, 
Near-Shared or Near-Complete. This is done as follows. Note that the given network does not classify as any of the first four 
networks in the list given above. Hence, we try to classify the given network as one of the remaining four networks (i.e., the 
'near' type networks). 

We know that the sorted degree vector is (4,3,3,2,2) for the given network. The ideal order for the shared network 
comparison is calculated by taking the average degree (which is 2.8) and rounding to the nearest integer (which gives 3). This 
means we have to compare the network to a 3-regular network. The total MSD from the shared network is thus ((4 — 3) 2 + 
(3 - 3) 2 + (3 - 3) 2 + (2 - 3) 2 + (2 - 3) 2 ))/5 = 0.6. The total MSD of this network from Star network is ((4 - 4) 2 + (3 - 
l) 2 + (3 - l) 2 + (2 - l) 2 + (2 - l) 2 ))/5 = 2. Similarly, the total MSD from Null network is 8.4, and the total MSD from the 
Complete Network is 2. The value 0.6 being the least among these and less than 10% of maximum deviation 16, we classify 
the above network structure as Near-Shared. 

E. Multiple Classification of Pairwise Stable Structures 

We note that the classification of pairwise stable network structures according to Table iHll is not mutually exclusive. There 
can exist networks which can be classified as more than one of the types described in Table [III] We illustrate a couple of 
interesting network structures that we encountered during our simulations here. Figure Ufa) refers to a pairwise stable network 
that emerged when we ran the simulation with random_seed = 6875, 5 — 0.7, c = 0.55. We observed that this network is both 
a Near-Shared network as well as a Tri-partite complete network whose parititions are (0, 6, 7, 8), (1, 2, 5), (3, 4, 9). In such 
cases, we classify the network structure as a K-Partite Complete network. 

Another example is shown in Figure |4jb) which is obtained when running simulations with random_seed — 15256, S 
(1.5. i = 0.5. We observe that this graph can be classified as a regular (or Shared) network with degree=5. However, it turns 
out that this graph is also an equi-partitioned bipartite network with partitions (0, 3, 4, 8, 9), (1, 2, 5, 6, 7). In such cases, we 
classify the graph as equi-bipartite network (or the Turan network). 

F. Interpretation of Pairwise Stability 

In a pairwise stable network, if a node adds a link to another node and gains strictly from it, the other node should lose 
strictly. Hence, the addition of the link becomes infeasible in this case. However, nodes in a pairwise stable network can still 
add links if adding these links does not change the payoffs of either of the nodes. In this case, the nodes are indifferent about 
adding the link. In the case of deletion, a node will delete a link from the current network unilaterally if it strictly benefits 
from doing so. We use this interpretation of pairwise stability during the course of our simulations. 
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(a) (b) 

Figure 4: Possibility of multiple classifications for a given network structure 



G. Model Validation 

We now proceed to understand some of the results of our simulations. First, in this section, we focus on the validation of 
our theoretical results on pairwise stability as shown in Figure [5] We are interested in knowing the following aspects in the 
simulations. 

« Do the pairwise stable networks identified in Table Q] actually emerge in the simulation process? 
« If so, under what values of 8 and c do they emerge? 
« Do the conditions match with the theoretical results? 

We conducted our simulations for all combinations of 8 and c as explained before. Figure |3a)-Figure |5jr) validate the 
analytical results derived in Table U . The vertical axis of each plot in Figure [5] is the benefit value (8), ranging from to 1, 
and the horizontal axis represents the cost parameter (c), ranging from to 1, In general, given a particular value of S and c, 
there may be multiple network structures that may be pairwise stable. The type of network structure emerging in the network 
formation process depends on a number of factors like the initial network, the scheduling order of the nodes along with the 
parameters of 8 and c. Hence, we run each simulation run Num- Repetitions times each time starting with random schedules 
and starting with different initial networks with the hope of getting all possible pairwise stable networks. In particular, we start 
with three different initial networks with densities (0,0.35,0.7) respectively as shown in Table [Hi 

We plot the pairwise stable regions for different networks namely bipartite complete network, null network, complete network, 
etc and compare with the theoretical predictions. Figure [5ja)-(d) show theoretical results and Figure |5je)-(r) show the results 
from the simulations. 

Figure |5je) shows the regions where the Bipartite Complete (BPC) network emerged as one of the pairwise stable network 
when the simulation run was started with number of nodes (N = 10) and initial network with density(7 = 0). Clearly, we can 
see that BPC does not emerge as pairwise stable in the regions where 8 < c as the null network (which coincides with the 
initial network) is also pairwise stable and the nodes prefer not to add any links to the initial network. However, Figure |3f) and 
Figure |3g) show that if the starting network is already having some existing links then nodes try to form BPC network even 
in the regions where 8 < c. This shows the importance of the initial network in the network formation process. Figure |5fh) is 
obtained by merging all the regions of Figure [3e)-(g) and this closely corresponds to the theoretical predictions of BPC stability 
shown in Figure [2 a). Figure |5ji)-(l) similarly show results for N = 20. In this case, however, we observe that Figure |5jl) is 
not as close to Figure [2 a) which is due to the fact that there may be many more pairwise stable topologies that may emerge 
as the number of nodes increase which illustrates a fundamental difficulty in characterizing all pairwise stable networks for 
every possible value of number of nodes (N), 

Another observation is that the complete network is theoretically proven to be the unique pairwise stable network in the 
region shown in Figure |3c). We can clearly see the simulation results in Figure |5Jh) and Figure |5jl) that this region is clearly 
excluded from the BPC stable region as starting with any initial network, only the complete graph emerges as unique the 
pairwise stable network in the region specified by Figure [3c). 

We similarly show the stability regions for complete and null networks in Figure Elm) and Figure [3o) respectively which 
corresponds to the theoretical predictions of Figure [5fb) and Figure P5Jd) respectively. As explained earlier, Figure |5jn) again 
illustrates the importance of initial network in making the null network as the pairwise stable network. 

As shown in Proposition [3] the equi-kpartite network is stable when 8 = c and Figure [3p) shows that indeed in this region, 
the equi-kpartite network does emerge as the pairwise stable network when N — 20. Proposition [3] was only a sufficient 
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Pairwise Stable(PS)-theoretical 
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N = 20, y. 
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NEAR-SHARED- PS (simulations) 
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0.5 

cost (c) 
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N = 1 0, T= 0.35 



0.5 1 0.5 1 

cost (c) cost (c) 
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(n) 



KPAR-COMP- PS (simulations) 
N = 20, y= 0, 0.35, 0.7 




Complete (COMP) Network 
Unique PS-theoretical 




BPC- PS (simulations) 
N = 10, 1=0.7 



(g) 



BPC- PS (simulations) 
N = 20, 1= 0.7 




NULL- PS (simulations) 
N = 20, 1= 0, 0.35, 0.7 




Null Network 
Pairwise Stable(PS)-theoretical 




a 0. 



BPC- PS (simulations) 
N = 1 0, y= 0,0.35, 0.7 



cost (c) 

(h) 

BPC- PS (simulations) 
N = 20, 1= 0, 0.35, 0.7 




0.5 1 

cost (c) 

(1) 

EQUI-KPAR-COMP- PS (simulations) 
N = 20, 1= 0, 0.35, 0.7 




(q) 

Figure 5: Validation of theoretical results through simulations [Repetitions = 100: for each (S, c) pair ] 



condition, we observe from the figure that there are other regions of S and c (which we have not analytically characterized) 
at which equi-kpartite network emerges as the pairwise stable network. 

As explained earlier, our characterization of pairwise stable network structures as shown in Table Q] is not exhaustive and 
hence, we used simulations to depict the region of stability for important types of network structures namely the near-shared 
network and k-partite complete network. We show the results in Figure (5jq) and Figure |5jr). 

H. Emergent Network Topologies During Simulations 

Figure [6] shows the simulation results for 10-node and 20-node networks. The exact parameter configurations and the initial 
network densities are marked in Figure [6] The vertical axis of each plot in Figure [6] is the benefit value (6), ranging from 
to 1, and the horizontal axis represents the cost parameter (c), ranging from to 1. As noted earlier, for a < c, 6 > pair, we 
repeat the simulation for Num-Repetitions. Each repetition for the simulation results in a network that can be classified as one 
of the structures mentioned in the theoretical analysis. We plot the most frequent (modal ) network structure as determined 
by the frequency with which each of the network structures resulted in Num-Repetitions simulation runs. The experiment was 
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N = 10, y=0 



N = 10,7=0.35 
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0.5 
cost (c) 



BIPARCOMP 
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N = 20, 7=0.35 



N = 20, 7=0.7 
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cost (c) 
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- KPARCOMP 



I 




0.5 
cost (c) 

Figure 6: Network topologies obtained during simulations 



BIPARCOMP 



KPARCOMP 



repeated starting with different network densities, 7 = 0, 0.35 and 0.7. We list some of the abbreviations used in the legends 
of the plots in Table [6] 

In each of the plots in Figure |6l we observe that the complete graph is the resultant pairwise stable network (when S > c, 
(5 — c) > 5 2 ) which concurs with the theoretical deductions that the complete graph is the unique pairwise stable network in 
this region (Table U and Figure [2c)). 

We can also infer from Figure|5ja), Figure|5jb) and Figure|5|d) that there is an overlap in the stability regions among complete 
and complete bipartite and also between null and complete bipartite networks. However, as observed through simulations 
(Figure |5), we see that the complete bipartite network emerges as the modal pairwise stable network in its regions of overlap 
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TUR_GRA 


Turan Graph 


B1PARCOMP 


Bipartite Complete 


NRSHARED 


Near-Shared 


KPARCOMP 


KPartite Complete 



Table IV: Some abbreviations used in Figure [6] 



with the aforementioned networks. This can be attributed to the fact there are a large number of possible bipartite graphs 
whereas there is only one null network and one complete network. Hence, the likelihood of the null and complete emerging 
in a region where the bipartite network is also pairwise stable, is small. 

We also observe from some of the plots in Figure [6] that Near-Shared and K-Partite Complete networks emerge as pairwise 
stable networks under some regions of the parameters. As explained in earlier sections, this can be attributed to the fact that 
our analytical results (as shown in Table [TJ) is not exhaustive and there exist some new topologies ( which we characterize as 
Near-Shared or K-Partite Complete networks) which are also pairwise stable. 

/. Network Evolution 

Having studied the macroscopic behaviour of our simulations, we investigate the network formation process from a mi- 
croscopic viewpoint. We examine various snapshots during the network formation process of a single simulation run which 
is repeated just once for a fixed parameter of 5 and c. We consider S = c = 0.5 as our parameter configuration. We can 
observe from the our proposed utility model (Equation [TJ that for this configuration the benefits from direct links is and so, 
nodes try to maximize the benefits due to bridging behavior. The nodes form/delete links such that they emerge as a bridge in 
connecting their unconnected neighbors. Hence, we would expect the final pairwise stable network to be consisting of nodes 
who are filling the positions of structural holes in the network. In other words, the emergent pairwise stable graph should be 
a triangle-free as nodes form links with nodes who are themselves are not connected with each other. 

We depict the snapshots of network formation process in Figure Q We can see that initially the nodes are forming links 
in such a way that triangles are not present but eventually triangles eventually do form due to the cumulative action of other 
nodes in the network. When triangles emerge in the neighbourhood of a node, it leads to deletion of links from that node 
(as the node will benefit strictly from deletion) and the final emergent network (Figure |7fl)) is a bipartite complete network 
(which is triangle-free) with alternate nodes in the ring layout depiction in Figure |7fl) belonging to the same partition. 

In complex network literature, the number of triangles in the network is a important parameter which was first studied by 
Watts and Strogatz (|5|) by definition the notion of clustering, sometimes also known as network transitivity. Clustering refers 
to the increased propensity of pairs of people to be acquainted with one another if they have another acquaintance in common. 
Watts and Strogatz define a clustering coefficient (denoted by C) that measures the degree of clustering in a undirected 
unweighted graph. 

3 x Number of triangles on the graph 

( = 

Number of connected triples of vertices 

The factor three accounts for the fact that each triangle can be seen as consisting of three different connected triples, one with 
each of the vertices as central vertex, and assures that < C < 1. A triangle is a set of three vertices with edges between 
each pair of vertices; a connected triple is a set of three vertices where each vertex can be reached from each other (directly 
or indirectly), i.e. two vertices must be adjacent to another vertex (the central vertex). 

It can be observed from the utility model proposed in equation ([TJ in Section UJJ that I — ^— I component in the utility model 

\ ( 2 ) / 

corresponds to the clustering coefficient of node i. Thus, in our utility model, nodes benefit from having lesser clustering 
coefficient as this will lead to the formation of structural holes, which in turn leads to increase in the payoff for the node. We 
elaborate more on this when we discuss efficient network topologies in Section [V] 

We now study how the clustering coefficient changes as the network evolves through the different phases shown in Figure [7] 
We plot this result in Figure 0a). We see that upto time epoch 50 clustering coefficient is 0. Later there is a increase in the 
value which is followed by the reduction in the clustering coefficient back to (at time epoch 150) when the pairwise stable 
network emerges. As explained before, this is indeed the expected behaviour during the network formation process for the 
parameters 6 = c = 0.5. 

We also study the average clustering co-efficient in all the pairwise stable networks that emerge for different values of 5 and 
c. We take the average over running Num- repetitions number of times. The result is shown in the 3d plot in Figure [8] (b). We 
can see that the clustering coefficient assumes value of 1 in the regions where the complete network is stable and when the 
null network is stable. In other regions, the clustering coefficient value is between and 1 which indicates a tradeoff between 
the benefits from direct links and the benefits from bridging benefits to the nodes in the network. 

J. Average Number of Actions before Convergence 

In this section, we will study the effect the initial network density has on the effort needed by the nodes to achieve 
convergence to a pairwise stable network. A single addition of an edge or a single deletion of an edge by a node is considered 
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Figure 7: Evolution of the network formation process (N — 20, 8 — 0.5, c = 0.5) 



to be a single 'act' by that player. We now study the mean number of acts performed by the players to converge to a pairwise 
stable network starting from various initial random networks. We can see from Figure |9ja) that the number of changes to 
the network is more when the 5 > c region and this is because the initial network is a null network and the players need to 
perform a lot more additions/deletions to the network before reaching the final stable network which is the complete network. 
When 5 < c, the players need not perform any change to the network as the initial null network is already pairwise stable. In 
fact, we can observe from the Figure [9] that the number of acts needed to reach the complete network is maximum (about 180) 
when starting with null network than when compared to other scenarios of 7 = 0.35 and 7 = 0.7 (mean acts is about 130). 
We observe a reversal of the work needed to reach null network in Figure |9]c) where more number of changes is needed 
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N = 20, y=0, 0.35, 0.7 
N=20, delta=0.5, c=0.5 




Figure 8: Study of Clustering Coefficient (N = 20) 




to reach null network than reaching the complete network. This can be attributed to the fact that the initial network is already 
a dense network to start with and it takes relatively less effort to reach the complete network than the null network under 
appropriate configurations of 5 and c. 

Initial network density of 0.35 corresponds to a medium-dense network (Figure |9jb)) and hence there is a non-zero effort to 
reach any of the pairwise stable network under any parameter configuration. However, as in Figure |9ja), it takes more effort 
for players to reach the complete network than the null network. 

V. Analytical Characterization of Topologies of Efficient Networks 

In this section, we study the structure of efficient networks, i.e., networks that maximize the overall utility, under various 
conditions of 5 and c. First, we begin by introducing a few useful classical results in extremal graph theory and we use these 
results later in our analysis. 

A. Triangles in a Graph 

If three nodes i, j, and k in G(V, E) are such that i and j, j and k, k and i are connected by edges, then we say that nodes 
k form a triangle in G. The number of triangles in a simple graph G plays a crucial role in the computation of payoffs to 
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the nodes and we state here some classical results. We know from Turan's theorem (1551) . that it is possible to have a triangle 
free graph if the following holds: 

T 



e < 



(7) 



Here e denotes the number of edges and n the number of vertices of the graph. Moreover, from (1561) . we know that the 
number of triangles, T, can be lower bounded, if the number of edges exceed the above value \Jj-\, by 



T > 



n(4e 



9 



(8) 



In what follows, we refer to the graph having maximum number of edges with no triangles as the Turan Graph and we 
represent it by Gruran- It is easy to verify that such a graph is a complete bipartite graph, and the the number of vertices in 
each partition differs at most by 1. 



B. Finding the Efficient Graph 

Definition 1 (Efficient Graph): The utility (u(G)) of a given network G is defined as the sum of payoffs of all the nodes 
in that network. That is, 

n 

u(G) = ^2ui(G). (9) 

»=i 

A graph that maximizes the above expression (i.e. sum of payoffs of nodes) is called an efficient graph. 

We now present a series of results on the topologies of efficient networks using the proposed framework. These results are 

based on different ranges for the values of 6 and c. 

Proposition 4: When 6 < c and S 2 < (c — 5), the null graph is the unique efficient graph. 

Proof: For any node i, d{ > implies that the utility of that node is negative thus reducing the overall network utility. 
This follows from (6 — c + 5 2 ) being negative. ■ 
Proposition 5: When 8 = c, the Turan graph is the unique efficient graph. 

Proof: We will analyze the efficiency of an arbitrary graph (denoted by G) as follows. 

n n / \ 

u(G)=5>(G)=5>5 2 l--$r 

i=l i=l \ \2 ) / 



i— 1 1=1 
n c2 n 

<5 2 Yd l - T ±—Y2a l 



(n-2) 



(2 x 3 x T 3 (G)) 



(10) 



where, T^(G) is the number of triangles in the graph G. The last step of the above simplification is due to the fact that the 
number of links among the neighbours of a node i is the number of triangles in the graph in which node i is one of the vertices 
of the triangle. The factor 3 in the last step is due to the fact that every triangle contributes to the cr^ of 3 nodes. We know 
that, for an efficient graph, Equation ( TTOb should be maximized and that happens when the number of triangles in a graph is 
minimized while simultaneously maximizing the number of edges in the graph. 

The Turan graph (refer Equation (0) is a graph with maximum edges that has no triangles. So an efficient graph must have 
an efficiency greater than or equal to that of a Turan graph. Thus, it is clear that there is_no need to consider graphs with 
edges lesser than that of a Turan graph. Let us consider the case when a graph (denoted by G) has more edges than the Turan 
graph. Let G have [^J + x e dges where x > 0. From Equation ( flOl l, we know that 



u(G) 



^>(G) 

i=i 



+ x 



E 

i=l 

5 2 



2a z 



(n-2) 

where Ts(G) is the number of triangles in G. From Equation ([SJ, we have 

5 2 



{di - 1) 
(6T 3 (G)) 



u{G) < <T 



+ x 



(n-2) 
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4e 



(11) 



(12) 
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Since T 3 (GTuran) = 0, the efficiency of the Turan graph is: 

u(GTuran) = ^ ' Uj(GTuran) 

The change in efficiency (Au) between the two graphs is 

Au = u(G) - u(G T uran) < 2S 2 [ X 



^2 x 


n 2 






) 




_ 4 _ 





4x 



(13) 



(14) 



(n-2) 3 

which is clearly negative for any x > 0. This implies that the Turan graph is the unique efficient graph. ■ 
Proposition 6: When 6 < c and 5 2 > (c — S), the Turan graph is the unique efficient graph. 

Proof: We prove this by contradiction. Assume that G is any graph other than the Turan graph and G is efficient. We 
show below that G cannot have lesser number of edges than G turan, 



i[G) = J2 <G) = (6- c )J2d t + J2 dr5 2 1 * 

2=1 2=1 2=1 \ \ 2 , 

n 

2=1 

n 

) whenever, ^ d, < 2 - 



i=l 



And observe, if G has same number of edges as Gturan and is different from it, it can contain triangles and will have an 
utility less than that of Gturan, as the benefit from bridging would go down and the benefit from direct links would remain 
unchanged. 

Thus G contains more edges than Gturan- Observe, that the benefit from direct links is negative (8 — c) Y^i=o di < 0> an d 
G has an higher utility compared to that of Gturan- It has to be that the bridging benefits in G has to be greater than that of 
the Turan graph, as the utility due to direct links term has become more negative compared to its value in Gturan 



lb I I V I 

c(G) = J2 u i (G) = (<S-c)£ d * + E d>6 2 1 - -g- 

i=l i=\ i=l \ V 2 / . 



negative 



utility more than G7 



This implies that this graph would give a higher utility for the <5 = c case, as the first term is there. This contradicts 
Theorem [5] and so our assumption must be wrong. Hence the Turan graph is efficient. ■ 



Parameter Range 


Efficient Topologies 


8 < c and 8' z < (c - 8) 


Null network 


8 < c and 8' A > (c - 8) 


Turan network 


S = c 


Turan network 


8 > c and 8' 2 > 3(8 - c) 


Turan network 


8 > c and (8 - c) > 2<5^ 


Complete network 



Table V: Characterization of Topologies of Efficient Networks in NFLP 



Proposition 7: When 5 > c and 5 2 > 3(6 — c), the Turan graph is the unique efficient graph. 

Proof: Let G be the efficient graph. Using a similar analysis that lead to Equation (TT2l . we can see that 

,2 



u(G) <(S + c + S") 
= (6 + c + 6 2 ) 



+ x 



+ x 



For the Turan graph, it can also be seen by simple analysis that 



uiGruran) 



(n-2) 
(n-2) 



(S-c + S 2 ) 



4e — nf 
9 



(15) 



u(G) - ti(G T „™„) < 2x (5 - c + 5 2 ) 



AnS 2 



3(n - 2) 



<2x[ {8-c+S 2 )-^- 



(16) 
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Thus, when 8 2 > 3(8 — c), the Turan graph is the unique efficient graph. ■ 
Proposition 8: When 8 > c and (8 — c) > 2<5 2 , the complete graph is the efficient graph. 

Proof: It can be shown that starting with an arbitrary graph G (which is not a complete graph), adding an edge between 
two nodes i and j (with smallest degree) increases the cumulative utility of these two nodes by at least 28 2 . At the same time, 
there is a decrease in utility of a common neighbour of nodes i and j, say node k, as there is a decrease in the bridging benefits 

2S 2 

of node k. It can be shown that the cumulative decrease in utility of all such common neighbours formed is — min(di,dj) 

d k - 1 

which is less than equal to 28 . Repeating the above process, we get the complete network. ■ 
Conjecture 1: When 8 > c and (5 — c) < 8 2 < 3(8 — c), the Turan graph is the efficient graph. 
Conjecture 2: When 8 > c and (S - c) < 28 2 : 

(i) if (8 — c) > ^r^S 2 , then the complete graph is the efficient graph. 

(ii) if (5 — c) < ^2<5 2 , then the Turan graph is the efficient graph. 
We summarize the above results on efficiency in Table [V] 



VI. Price of Stability (PoS) of the Proposed Model 



Recall that PoS d35l) is the ratio of the sum of payoffs of the players in a best pairwise stable network to that of an efficient 
network. In NFLP, a best pairwise stable network means a pairwise stable network with a maximum value of the sum of 
payoffs of the players. By invoking the results derived in the previous sections, we now present our results on PoS for the 
proposed model. 

Theorem 1: The price of stability (PoS) is 1 in each of the following scenarios: 

(i) 8 > c and (8 - c) > 28 2 , 

(ii) 8 > c, S 2 > (8 - c) and 8 2 > 3(8 - c), 
(iu)S = c, 

(iv) 8 < c and S 2 > (c - 8). 

This theorem can be proved easily using the results summarized in Table J] and Table [V] 
Note: Since the null network is the only efficient network when 8 < c and 8 2 < (c — 8), PoS is not defined in this region. 

In view of Conjecture Q] the following result presents bounds on PoS. 

2 ^ o/f „\ n„c ^ 1 



Proposition 9: When 8 > c and (8 - c) < <r < 3(8 - c), PoS > 



2 ' 



Proof: We know that, under the conditions 8 > c and (8 — c) < 8 2 < 3(8 — c), the pairwise stable graph with the highest 
utility is the Turan graph (as seen from Table JJ. Let Conjecture Q] be false._In_this scenario, let us denote the efficient graph 
by G. We will now evaluate an upper bound on the maximum efficiency of G. G has to have more direct links than the Turan 



graph (as 8 > c) to be a candidate for efficient graph. Let G have 



edges where x > 0. 



n n n / 

i(G) =£iii(5) = (S - c)^d, + Y J d i 5 2 1- <T 

i=l i = l 1=1 \ 



i d i) 



di 



Since di can be at most (n — 1), 



t (G)<(5-c + 5 2 )n(n-l)- ( 



u(G) <(S-c + 8 2 )n(n - 1) - I ^— ) T 3 (G) 

, n — 



26 



By Equation ®, we have 



"(G) < {5 - c + 5 2 )n(n~- 1) 
= (5 -c + 5 2 )n(n - 1) 



25 2 \ (n{±e 



n - 2 



, S 2 n \ /8x\ 

Since — > 0, we have 

\n-2 I 9 1 



i(G) < (5-c + (5 2 )n(n-l) 
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The Turan graph is pairwise stable under these conditions (refer Table H). Hence we get the following: 



u(G T uran) = (S - C + 5 ) 2 





n 2 














) 




_ 4 _ 





PoS > "( Gr Jj™n) > 



(S-c + 5 2 ) 



_ 1 1 

~~ 2 + 2n 



u(G) " (<5-c + <5 2 )n(n- 1) 

This implies that PoS > i. ■ 
Remark: In view of Conjecture |2j it can be noted that a similar bound can be obtained in the region 6 > c and (5 — c) < 25 2 . 
The details are not provided here due to space constraints. 

From Theorem Q] and Theorem [9] along with the simulation results, we conclude that, under mild conditions, the proposed 
NFLP produces efficient networks that are pairwise stable. This is desirable from the view of system design. 



VII. Conclusions and Future Work 

In this paper, we proposed a network formation game with localized payoffs (NFLP) and studied the topologies of pairwise 
stable and efficient networks. We gained additional insights about the network formation process through detailed simulations. 
We also studied the tradeoff between pairwise stability and efficiency using the notion of PoS. In particular, we computed the 
PoS of the proposed NFLP. Except for a few configurations of S and c, we have shown that PoS is 1. This means that, under 
mild conditions, that NFLP produces efficient networks that are pairwise stable. 

In the utility function we defined in Section [HI the payoff of any node had two components - benefit from direct links and 
benefit from bridging. The pairwise stable network topologies of our model (Section ITTTb shows that there are no bridges in 
the equilibrium networks. Bridges can also be considered as bottlenecks of information flow. Since every node is striving to 
obtain a bridging position there are no bridges in the equilibrium networks, this suggests that the proposed utility model avoids 
bottlenecks in decentralized network formation. Here are a few pointers for future work. First, the framework in this paper can 
be extended to the case of directed graphs and weighed graphs. This involves certain challenges such as defining the utility 
model appropriately. Second, the setting in this paper can be extended by varying the notions of stability and efficiency. We 
note that there are several possible notions of stability and efficiency that exist in the literature. The choice of an appropriate 
notion of stability as well as efficiency is a topic of debate. 

Further, our model gives us some valuable hints at the networks formed in real world as well. Some noted work in complex 
network literature has observed the emergence of bipartite graphs in real world scenarios An important example has 

been the class of collaboration networks. It has been observed that the network of actors basically is a uni-mode bipartite 
graph (57). Other important examples of real world bipartite networks include boards of directors of companies, co-ownership 
networks of companies and collaboration networks of scientists and movie actors. In the analysis of our proposed model in 
this paper, we have seen the emergence of important graph structures like the Turan graph and in general, bipartite graphs 
and fc -partite graphs during the network formation process under many configurations. Though our model does not precisely 
solve the difficult problem of identification of all parameters affecting network formation, it nevertheless offers valuable hints 
about some of the important parameters affecting real world network formation. The studies on our utility model of network 
formation also offers strong evidence that incorporation of important game theoretic concepts like pairwise stability is vital to 
the understanding of complex network formation behaviour. 

It is the goal of our future work to expand the horizon of our understanding of other class of real world networks namely 
the Internet (or the world wide web), epidemic networks, friendship networks, power grid networks, etc, and propose suitable 
strategic complex network formation models that, at least, approximately imitate the formation behaviour of some of these 
important real world networks. 
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